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upi atING TO BINDING PROTEINS FOR ^iSm ^S^W^Sl^^^ 

nFnNA . , Stsfe^ r 3S 

TTris application is the nalional phase of ^atonal *J ^ <<thete h no prospect of achieving a zinc 

designated the U.S. JJg** only limited ****** 

FIELD OF THE INVENTION J 6n d . For 6X ample, Desjarlais er a ., (1992b PNAS 

BACKGROUND OF TUB INVENTION 

g « -i-^j— W SEE 5SK5S - — "»> - - 

^delv used domain within protein ^eUh eve Inez * raore generally applicable 

JSSTSSm in understanding crucial processes such 252, 809-817 Fa*aU , ^ 2fil> 

SKS of gene expression in differentiation and 483-487 , and^ I avUtt ^ is princi- 

gene expression in Drosophila (e.g. Harrison & Items 1987 uipk t ^fcutes o be 

DNA doubU hSy inserting an a-helix into the major J^~f c J of a£268 for example, the protein » 
DNAaounie u j Biochem. 61, positions. in a 9 base pair 

ElcgSJn^S- (London) 353, 715-719; Jtf. JSlJKi ^ A linker ^sequence* 

aSd Klug 993 Gene 135, 83-92). 45 0 ° u Tbemen fingers which appears to make no direct 

radta tom the geometrical and chemical complementarity Km 1 » ^ 

bet^Te amino acid side chains of the a-hehx and the "JJJ^ ^ experiments have shown that 1 is 
Si. groups exposed on the edge. ^ toT rationally to W^^g^SS 

that certain amino acid side chains have presences or / & B 1992 b Proc. Natl. Acad. Sci. USA W, 
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, • j i- wij^ llc : niT Further whilst allocation of amino acids at the designated 
studied their binding to randomised oligonucleotides, using ^^^!^^^ ^^y^^^^ntd 
elcctrophorctic mobility shift assays. Subsequent use of nmdom 8^ ^ T ()r T } or a 

phage display technology has permitted the expression of J^^^^h pos]ti or>s K 
random libraries of Zf ^J^^^^mi 5 ' Preferably the zinc finger binding motif is present within 
bacteriophage. The three Zf domains of W context of other amino acids (which may be present in 

positionswithmfingerone randomised havebeendisplayed proteins), so as to form a zinc finger (which 

on the surface of filamentous phage by Rebar and Pabo mclude * an aatiparallel B-sheet). Further, the zinc finger is 
(1994 Science 263, 671-673). The library was then sub- fcrabl displaycd as part of a zinc finger polypeptide, 

jected to rounds of affinity selection by binding to target * hfch polypeplide comprises a plurality of zinc fingers 
DNA oligonucleotide sequences in order to obtain Zf pro- 10 ^ by an ^p^ing linker peptide. Typically the library 
tcins with new binding specificities. Randomised mutagen- of sequences , s sucn that the zinc finger polypeptide will 
esis (at the same postions as those selected by Rebar & comprise two or more zinc fingers of defined amino acid 
Pabo) of finger 1 of Zif 268 with phage display has also been scquence (generally the wild type sequence) and one zinc 
used by Jamieson et al., (1994 Biochemistry 33, 5689-5695) fingef having a zinc ^ et binding motif randomised in the 
to create novel binding specificity and affinity. 15 manner defined above , It * preferred that the randomised 

More recently Wu et al. (1995 Proc. Natl. Acad. Sci. USA finger 0 f the polypeptide is positioned between the two or 
99 344-348) have made three libraries, each of a different more fin g e rs having defined sequence. The denned lingers 
finger from Zif268, and each having six or seven a-helical ^ establish the "phase" of binding of the polypeptide to 
positions randomised. Six triplets were used in selections D NA, which helps to increase the binding specificity of the 
but did not return fingers with any sequence biases; and 20 ^o^d fi nge r. 

when the three triplets of the Zif268 binding site were Preferably the sequences encode the randomised binding 
individually used as controls, the vast majority of selected motif of the midd i e fi nge r of the Zif268 polypeptide, 
fingers did" not resemble the sequences of the wild-type Conveniently, the sequences also encode those amino acids 
Zif268 fingers and, though capable of tight binding to their N . te rminal and C-terminal of the middle finger m wild type 
target sites in vitro, were usually not able to discriminate 25 zif268> w hich encode the first and third zinc fingers respec- 
strongly against different triplets. The authors interpret the dvely In a particular embodiment, the sequence encodes the 
results as evidence against the existence of a code. whole Q f t he Zif268 polypeptide. Those skilled in the art will 

In summary it is known that Zf protein motifs are appreciate that alterations may also be made to the sequence 
widespread in DNA binding proteins and that bindingisvia ^ 0 f the linker peptide and/or the p-sheet of the zinc finger 
three key amino acids, each one contacting a single base pair - polypeptide. 

io the target DNA sequence. Motifs arc modular and may be In a further aspect, the invention provides a library ot 
linked together to form a set of fingers which recognise a D NA sequences, each sequence encoding the zinc finger 
contiguous DNA sequence (e.g. a three fingered protein will binding motif of at least a middle finger of a zinc finger 
recognise a 9 mer etc). The key residues involved in DNA binding polypeptide for display on a viral particle the 
binding have been identified through sequence data and seq uences coding for the binding motif having random 
from structural information. Directed and random mutagen- allocation of amino acids at positions -1, +2, +3 _and I +0. 
esis has confirmed the role of these amino acids in deter- Conveniently, the zinc finger polypeptide will be ZiGob. 
mining specificity and affinity. Phage display has been used Typically, the sequences of either library are such that the 
to screen for new binding specificities of random mutants of ^ zinc finger D i ndmg domain can be cloned as a fusion with 
fingers. A recognition code, to aid design of new finger the minor coat protein (pill) of bacteriophage fd. 
specificities, has been worked towards although it has been Conveniently, the encoded polypeptide includes the tnpep- 
suggested that specificity may be difficult to predict. tide sequence Met-Ala-Glu as the N terminal of the zinc 

finger domain, which is known to allow expression and 
SUMMARY OF THE INVENTION digplay using the bacteriophage fd system. Desirably the 

In a first aspect the invention provides a library of DNA library comprises 10* or more different sequences (ideally, 
sequences, each sequence encoding at least one zinc finger as many as is practicable). 

binding motif for display on a viral particle, the sequences i n another aspect the invention provides a method ot 
coding for zinc finger binding motifc having random alio- designing a zinc finger polypeptide for binding to a particu- 
cation of amino acids at positions -1, +2, +3, + 6 and at least 50 iar target DNA sequence, comprising screening each of a 
at one of positions +1, iS and +8. plurality of zinc finger binding motifs against at leas an 

A zinc finger binding motif is the a-helical structural effective portion of the target DNA sequence and selecting 
moV^ffziTfiSr binding proteins, well known to those motifs which bind to the target DNA sequence. An 
to ftSbd n the art The above numbering isbased on the effective portion of the target DNA sequence is a sufficient 
tlT^Z ^ L to Shdix of the zinc finger binding 55 tog* of DNA to allow binding of the zinc binding motif to 
"will be apparent to those skilled the DNA. This is the minimum sequence information 

(concerning the target DNA sequence) that is required 
M^^^Xot^a^ot^ zincbinding Desirably at least two preferably three or more, rounds of 
finger motif. Nevertheless, the residue at -1 is shown to be screening are performed. 

very important functionally and is therefore considered as 60 The invention also provides a method ot designing ■ innc 
part of the binding motif a-helix for the purposes of the finger polypeptide for binding to a particular target DNA 
present invention. sequence, comprising comparing the bindmg of each of a 

^sTauences may code for zinc finger binding motifs plurality of zinc finger binding mollis to one or more DNA 
The ^ en ^ s ™* ^ 1 nositions +1 +5 and +8 triplets, and selecting those motifs exhibiting preferable 
S~ - ^ characterises. Preferably the method ^ defined 

le i Tposkion "9 although it is generally preferred to immediately above is preceded by a screening step accord- 
JTt^irT lyske residue at this position). ing to the method defined in the previous paragraph. 
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It is thus preferred that there is a two-step selection can result in the selection of DNA-bmding proteins with 
procedure: the first step comprising screening each of a reasonable affinity but without specificity for a given DNA 
plurality of zinc finger binding motifs (typically in the form sequence. Therefore, in order to minimise these non-specific 
of a display librarv), mainly or wholly on the basis of affinity interactions when designing a polypeptide, selections should 
for the target sequence; the second step comprising com- 5 preferably be performed with low concentrations of specific 
paring binding characteristics of those motifs selected by the binding site in a background of competitor DNA, and 
initial screening step, and selecting those having preferable binding should desirably take place in solution to avoid local 
binding characteristics for a particular DNA triplet. concentration effects and the avidity of multivalent phage 

Where the plurality of zinc finger binding motifs is for ligands immobilised on solid surfaces. 

- screened against a single DNA triplet, it is preferred that the ™ As a safeguard against spurious selections, the specificity 
triplet is represented in the target DNA sequence at the of individual phage should be determined folio wing the final 
appropriate postion. However, it is also desirable to compare round of selection. Instead of testing for binding to a small 
the binding of the plurality of zinc binding motifs to one or number of binding sites, it would be desirable to screen all 
more DNA triplets not represented in the target DNA possible DNA sequences. 

sequence (e.g. differing by just one of the three base pairs) 15 It has now been shown possible by the present inventors 
in order to compare the specificity of binding of the various (below) to design a truly modular zinc binding polypeptide, 
binding motifs. The plurality of zinc finger binding motifs wherein the zinc binding motif of each zinc binding finger 

* . may be screened against all 64 possible permutations of 3 is selected on the basis of its affinity for a particular triplet. 

DN A bases. Accordingly, it should be well within the capability of one 

Once suitable zinc finger binding motifs have been iden- 20 of normal skill in the art to design a zinc finger polypeptide 
s » tified and obtained, they will advantageously be combined in capable of binding to any desired target DNA sequence 

S a single zinc finger polypeptide. Typically this will be simply by considering the sequence of triplets present in the 

^ accomplished by use of recombinant DNA technology; target DNA and combining in the appropriate order zinc 

Q conveniently a phage display system may be used. oc fingers comprising zinc finger binding motifs having the 

111 In another aspect, the invention provides a DNAlibrary 25 «~«J fbmdmgcharactensticstobmd thereto^ The greater 

* consisting of 64 sequences, each sequence comprising a the length of known sequence of the argct ^tepNte 

* different one of the 64 possible permutations of three DNA number of zinc finger bindmgmotife tha canbemclu^ 
H bases in a form suitable for use in the selection method "> the zinc finpr polypeptide. For example if the known 
m SeTned above. Desirably the sequences are associated, or „ sequence is only 9 bases long then three zmc fingei : binding 

i capable of being associated, with separation means. 30 motifs can be included m the polypeptide If the known 

* Advantageously, the separation means is selected from one sequence is 27 bases long then, in heory up to nine binding 
of the following microtitre plate; magnetic beads; or affinity moUfe ,^ d be mdud * d m lhe ^^T^^^ 

U chromatography column. Conveniently the sequences are target DNAsequence, the lower Uie probability of its occur- 

U : biotinylated Preferably the sequences are contained within ™« in any given portion of DNA. 

L"' 12 mini-Ubraries, as explained elsewhere. Moreover, those motifs selected for inclusion in the 

Pi] m a further aspect the invention provides a zinc finger polypeptide could be artificially modrfied I (eg. by directed 

S\ po ypeptide designed by one or both of the methods defined mutagenesis) in order to optimise further heir binding 

- above Preferable zinc finger polypeptide designed by characteristics. Alternatively (or additionally) the length and 
uae method comprises a combination of a plurality -of zinc 40 sequence ° f ^ 

H fingers (adjacent zinc fingers being joined by an intervening zinc binding fingers could be varied, as outlined above This 

linfer peptide), each finger comprting a zinc finger binding ™y have the effect of J 1 ^^ 

motif. Desirably, each zinc finger binding motif in the zinc binding motif relative to the DNAsequence of interest and 
finger polypeptide has been selected for preferable binding thereby exert a further influence on binding characteristics, 
characteristics by the method defined above. The interven- 45 Generally, it will be preferred to select those motifs 
ing linker peptide may be the same between each adjacent having high affinity and high specificity for the target triplet, 
zinc finger or, alternatively, the same zinc finger polypeptide In a further aspect, the invention provides a kit for making 
may contain a number of different linker peptides. The a zinc finger polypeptide for binding to a nucleic acid 
intervening linker peptide may be one that is present in sequence of interest, comprising: a library of DNA 
naturally-occurring zinc finger polypeptides or may be an 5U sequences encoding zinc finger binding motifs of known 
artificial sequence. In particular, the sequence of the inter- " binding characteristics in a form suitable for cloning into a 
vening linker peptide may be varied, for example, to opti- vector; a vector molecule suitable for accepting one or more 
mise binding of the zinc finger polypeptide to the target sequences from the library; and instructions for use. 
sequence. Preferably the vector is capable of directing the expres- 

Where the zinc finger polypeptide comprises a plurality of 55 sion of the cloned sequences as a single zinc finger polypep- 
zinc binding motifs, it is preferred that each motif binds to tide. In particular it is preferred that the vector is capable of 
those DNA triplets which represent contiguous or substan- directing the expression of the cloned sequences as a single 
tially contiguous DNA in the sequence of interest. Where zinc finger polypeptide displayed on the surface of a viral 
several candidate binding motifs or candidate combinations particle, typically of the sort of viral display particle which 
of motifs exist, these may be screened against the actual 60 are known to those skilled in the art. The DNAsequences are 
target sequence to determine the optimum composition of preferably in such a form that the expressed polypeptides are 
the polypeptide. Competitor DNA may be included in the capable of self-assembling into a number of zinc finger 
screening assay for comparison, as described below. polypeptides. 

The non-specific component of all protein-DNA It will be apparent that the kit defined above will be of 
interactions, which includes contacts to the sugar-phosphate 65 particular use in designing a zinc finger polypeptide corn- 
backbone as well as ambiguous contacts to base-pairs, is a prising a plurality of zinc finger binding motifs, the binding 
considerable driving force towards complex formation and characteristics of which are already known. In another 
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aspect the invention provides a ki, for use when zinc finger region of the gene of taterest, designing a zinc ^ finger 

b^HKHi&wirt.^lob^d^^i^nJ present in the 

yet been identified, such that the mvent.on provides a kit for "usmg sa,d^ anc nnge P P ^ p ^ be 

=o^ - 5 

^ a Sr'n accJrdal with the' firs, aspect of the » polypeptide is designed by one or both of the methods of the 

invention Conveniently, the kit may also comprise a library invention defined above. 

of 64 DNAsequences, each sequence comprising a different Binding of the /.inc finger polypeptide to the target 

one of the 64 possible permutations of three DNA bases, in sequence may result in increased or reduced expression ot 

a form suitable for use in the selection method defined , he gene of interest depending, for example, on the namre ot 

previously Typically, the 64 sequences are present in 12 15 the targe , sequence (e.g. structural or regulatory) to which 

separate mini-libraries, each mini-library having one postion the polypeptide binds. 

in the relevant triplet fixed and two postions randomised. In addilionj the zinc finger polypeptide may advanta- 

Preferably, the kit will also comprise appropriate buffer ge0U sly comprise functional domains from other proteins 

solutions, and/or reagents for use in the detection of bound ^ g cata lytic domains from restriction enzymes, 
zinc fingers. The kit may also usefully include a vector 20 recom bi na ses, rcplicascs, intcgrascs and the like) or even 

suitable for accepting one or more sequences selected from " synt hetic" effector domains. The polypeptide may also 

the library of DNA sequences encoding zinc finger binding compr isc activation or processing signals, such as nuclear 

motifs. ' localisation signals. These are of particular usefulness in 

In a preferred embodiment, the present teaching will be uirglelling the polypeptide to the nucleus of the cell m order 
used for isolating the genes for the middle zinc fingers * to enhance the binding of the polypeptide to an intranuclear 

whfch having beln previously selected by one of the 64 ^ ( SU ch as genomic DNA). A particular example of such 

irtolets are thought to have specific DNA binding activity. a localisation signal is that from the large T antigen of SV40. 

The mixture of genes specifying fingers which bind to a Such other functional domains/signals and the tike are 

riven triplet will be amplified by PGR using three sets of conveniently present as a fusion with the zinc finger 
primers The sets will have unique restriction sites, which » polypep tide. Other desirable fusion partners comprise 

will define the assembly of zinc fingers into three finger immunoglobulins or fragments thereof (eg. Fab, sciw) hav- 

polvpcptides. The appropriate reagents are preferably pro- i ng binding activity. 

vided in kit form. The zinc finger polypeptide may be synthesised ,n situ in 

For instance the first set of primers might have Sfil and the cell as a result of delivery to the cell of DNA directing 
Jel sETsecond se, Agel and EagI sites and third set 35 expression of the polypeptide. Methods of facititatmg de iv- 

e£ and Notl sties. It will be noted that the "first" site will 6 ry of DNA are well-known to those skilled ,n the art and 

SfcSybe Sfil, and the "last" site Noll, so as to facilitate include, for example, recombinant viral vectors (e_ g. 

cloning into the Sfil and Notl sites of the phage vector. To retroviruses, adenoviruses), liposomes and the like^ 

assemble a library of three finger proteins which recognise Alternatively, the zinc linger polypeptide could be made 
mTMauence AAAGGGGGG, the fingers selected by the 40 ou ,side the ceU and then delivered thereto. Detivery could be 

triplet GGG are amplified using the first two sets of primers facilitated by incorporating the polypeptide into liposomes 

S tigatcd to the fingers selected by the triplet AAA etc. or by attaching the polypeptide to a targeting mo,e*y 

amplEd using the third set of primers. The combinatorial ( such as the binding portion of ^ antibody o : hormone 

hwlclonedonthesurfaceofphageandaninebase-pair molecule). Indeed, one significant advanage of zmc finger 

^.nbeusedtoselectthebestcombinationoffingersen gj. 

three-finger proteins can be sold as components ot a kit tor 4 examole 3 below) to a single-chain (sc) Fv 

proteins could be built. ^ ^ r ^ fi to mousc xhs via th e 

Additionally a large ^re-assembled) library of all com- 55 ™ transferrin receptor, 

binations of the fingers selected by all taplett ^can also be "^TT otit fe wells, resins etc.) coated with NIP 

developed for single-step selection of DNA-binding proteins for zinc fingers fused to 

using 9 bp, or much longer. DNA fragments. For tins ^^^ fo " a 3^^SmSring immobilised zinc 

IZds of select ion other than phage display; for example In a particular embodiment, the mvenuon prov.des a 

linked a cell of a zinc finger polypepUde wh 1 chinhib.ts the expres- 
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polypeptide which inhibits the expression of a gene enabling 
the cancer cells to divide. The target could be. for example, 
an oncogene or a normal gene which is overexpressed in the 
cancer cells. 

To the best knowledge of the inventors, design of a zinc 
finger polypeptide and its successful use in modulation of 
gene expression (as described below) has never previously 
been demonstrated. This breakthrough presents numerous 
possibilities. In particular, zinc finger polypeptides could be 
designed for therapeutic and/or prophylactic use in regulat- 
ing the expression of disease-associated genes. For example, 
zinc ringer polypeptides could be used to inhibit the expres- 
sion of foreign genes (e.g,. the genes of bacterial or viral 
pathogens) in man or animals, or to modify the expression 
of mutated host genes (such 

The invention therefore provides a zinc finger polypeptide 
capable of inhibiting the expression of a disease-associated 
gene. Typically the zinc finger polypeptide will not be a 
naturally-occurring polypeptide but will be specifically 
designed to inhibit the expression of the disease -associated 
gene. Conveniently the polypeptide will be designed by one 
or both of the methods of the invention defined above. 
Advantageously the disease-associated gene will be an 
oncogene, typically the BCR-ABL fusion oncogene or a ras 
oncogene. In a particular embodiment the invention pro- 
vides a zinc finger polypeptide designed to bind to the DNA 
sequence GCAGAAGCC and capable of inihibting the 
expression of the BCR-ABL fusion oncogene. 

In yet another aspect the invention provides a method of 
modifying a nucleic acid sequence of interest present in a 
sample mixture by binding thereto a zinc finger polypeptide, 
comprising contacting the sample mixture with a zinc finger 
polypeptide having affinity for at least a portion of the 
sequence of interest, so as to allow the zinc finger polypep- 
tide to bind specifically to the sequence of interest. 

The term "modifying" as used herein is intended to mean 
that the sequence is considered modified simply by the 
binding of the zinc finger polypeptide. It is not intended to 
suggest that the sequence of nucleotides is changed, 
although such changes (and others) could ensue following 
binding of the zinc finger polypeptide to the nucleic acid of 
interest. Conveniently the nucleic acid sequence is DNA. 

Modification of the nucleic acid of interest (in the sense 
of binding thereto by a zinc ringer polypeptide) could be 
detected in any of a number of methods (e.g. gel mobility 
shift assays, use of labelled zinc finger polypeptides — labels 
could include radioactive, fluorescent, enzyme or biolin/ 
streptavidin labels). 

Modification of the nucleic acid sequence of interest (and 
detection thereof) may be all that is required (e.g. in diag- 
nosis of disease). Desirably however, further processing of 
the sample is performed. Conveniently the zinc finger 
polypeptide (and nucleic acid sequences specifically bound 
thereto) are separated from ihe rest of the sample. Advan- 
tageously the zinc finger polypeptide is bound to a solid 
phase support, to facilitate such separation. For example, the 
zinc finger polypeptide may be present in an acrylamidc or 
agarose gel matrix or, more preferably, is immobilised on the 
surface of a membrane or in the wells of a microtitre place. 

Possible uses of suitably designed zinc finger polypep- 
tides are: 

a) Therapy (e.g. large tting to double stranded DNA) 

b) Diagnosis (e.g. detecting mutations in gene sequences: 
the present work has shown that "tailor made" zinc finger 
polypeptides can distinguish DNA sequences differing by 
one base pair). 
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c) DNA purification (the zinc finger polypeptide could be 
used to purify restriction fragments from solution, or to 
visualise DNA fragments on a gel [for example, where the 
polypeptide is linked to an appropriate fusion partner, or is 
5 detected by probing with an antibody J). 

In addition, zinc finger polypeptides could even be tar- 
geted to other nucleic acids such as ss or ds RNA (e.g. 
self-complementary RNA such as is present in many RNA 
molecules) or to RNA-DNA hybrids, which would present 
io another possible mechanism of affecting cellular events at 
the molecular level. 

In Example 1 the inventors describe and successfully 
demonstrate the use of the phage display technique to 
construct and screen a random zinc finger binding motif 
15 library, using a defined oligonucleotide target sequence. 
In Example 2 is disclosed the analysis of zinc finger 
binding motif sequences selected by the screening procedure 
of Example 1, the DNA-specificity of the motifs being 
studied by binding to a mini-library of randomised DNA 
20 target sequences to reveal a pattern of acceptable bases at 
each position in the target triplet — a "binding site signature". 

In Example 3, the findings of the first two sections are 
used to select and modify rationally a zinc finger binding 
25 polypeptide in order to bind to a particular DNA target with 
high affinity: it is convincingly shown that the peptide binds 
to the target sequence and can modify gene expression in 
cells cultured in vitro. 
Example 4 describes the development of an alternative 
30 zinc finger binding motif library. 

Example 5 describes the design of a zinc finger binding 
polypeptide which binds to a DNA sequence of special 
clinical significance. 

BRIEF DESCRIPTION OF THE DRAWINGS 

35 

The invention will now be further described by way of 
example and with reference to the accompanying drawings, 
of which: 

FIG. 1 is a schematic representation of affinity purification 
40 of phage particles displaying zinc finger binding motifs 
fused to phage coat proteins; 

FIG. 2 shows three zinc fingers (Seq ID No. 2) used in the 
phage display library; 

FIG. 3 shows the DNA sequences of three oligonucle- 
45 otides (Seq ID Nos. 3-8 ) used in the affinity purification of 
phage display particles; 

FIG. 4 is a "checker board" of binding site signatures 
determined for various zinc finger binding motifs (Seq ID 
Nos. 19-51); 

50 FIG. 5A-5F show graphs fractional saturation against 
concentration of DNA (nM) for various binding motifs and 
target DNA triplets; 
FIG. 6 shows the nucleotide sequence of the fusion 
55 between BCR and ABL sequences in pl90 cDNA (Seq ID 
No. 9) and the corresponding exon boundaries in the BCR 
and ABL genes (Seq ID Nos. 10-11), 

FIG. 7 shows the amino acid sequences of various zinc 
finger binding motifs (Seq ID Nos. 12-17) designed to test 
60 for binding to the BCR/ABL fusion; 

FIG. 8 is a graph of peptide binding (as measured by 
A 450 ^ 60 nm) against DNA concentration (/M) of target or 
control DNA sequences; 

FIG. 9 is a graph showing percentage viability against 
65 time for various transfected cells; 

FIGS. 10A-10C and 11 illustrate schematically different 
methods of designing zinc finger binding polypeptides; and 
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a polypeptide (Seq ID No. 18) designed to bind to a ^ enzyme Sfi , (underlined) have been 

particular DNA sequence (a ras oncogene). fey site . directed mutagenesis using the oligonucle- 
EXAMPLE 1 5 otide(SeqlDNo.l): 

cTCCTGCAGTTGGACCTGTGCCATGi2££G_ 

In this example .he inventors ha^ n^ a ^reemng ntTI'GGGC CGCAlAGAAl'GGAACAACrAAAGC 

technique to study sequence-specmc DNA recognition oy 

zinc finger binding motifs. The example describes how a 

librarv of zinc finger binding motifs displayed on the surface w h anncals in thc region of the polyhnkcr, (L. Jcspcrs, 

of bacteriophage enables Selection of fingers capable of al communication). Etectrocompe.ent DH5a cells 

binding to riven DNA triplets. The amino acid sequences of were transformed with recombinant vector in 200 ng 

selected fingers which bind the same triplet were compared aU tor j. hour in 2 xTY medium with 1ft 

to examine how sequence-specific DNA recognition occurs. and plated on jye contammg 15 ^g/ml tetracychne 

ITieresultscanberationah^dintermsofcodedrnteractons and i% glucose. 

between zinc fingers and DNA, involving base contacts 15 RQ 2 me acid sequence (Seq ID No. 2) of 

from a few a-helical positions. tne thr6e zinc fingers from Zif268 used in the phage display 

An alternative to the rational but biased design of proteins Ub top an d bottom rows representthe sequence* ot 

wuh new specificities, is the isolation of desirable mutants , hc flrst and third fingers respectively. The middle row 

fom aTarge pool. A powerful method of selecting such r nts tbe sequence of the middle finger The ran- 
oTeins fefhedoningof peptides (Smith 1985 Science 228 20 do P mised ^ions in the a-helix ot the middle linger have 

i or protein domains (MeCafferty et al., 1990 tesidlles marked 'X'. The amino acid positions are num- 

Namre (London) 348, 552-554; Bass et al., 1990 Proteins 8, bered relative to the first helical residue (position 1). lor 

XsiSf as fusions to the minor coat protein (pin) of amin0 acids at posi ,ion S -1 to +8, excluding ^ conserved 

bacteriophage fd, which leads to their expression on the Up Uu and His, codons are equal mixtures of (GA,C)NN. 1 in 
oTthTcaosid Phage displaying the peptides of interest can a ^ first base position is omitted in order to avoid stop 

men be affinii purified and amplified for use in further codoos> blIt tnis has the unfortunate effect that the codons for 

rounds of selection and for DNA sequencing of the cloned T Phe> jy t ^d Cys are not represented Position +9 is 

Tne Tnei^venK.rs applied this technology to the study of sp £ ifle d hy the codon A(G,A)G, allowing either Arg or Lys. 

ftac finger-DNA interactions after demonstrating that func- ^ sidttes of the hydrophobic core are circled whe eas the 

tional zkc finger proteins can be displayed on the surface of zinc ^ds are written as white letters on black circks. The 

fd Sha« and that the engineered phage can be captured on " „ i( f n s form ing the B-sheets and the a-hehx of the zinc 

a soUd support coated with specific DNA A phage display fingers are marked below the sequence, 

library was created comprising variants of the middle finger Phag e Selection 

from the DNA binding domain of Zif268 (a mouse Iran- Colonies wer e transferred from plates to 200 ml 2xTY/ 

Stion factor containing 3 zinc finger^Christy et al., Z[yTet (2xTY containing 50 fM Zn(CH^C00) 2 and 15 
19881 DNA of fixed sequence was used to purify phage ^ , ml tetracycline) and grown overnight. Phage were pun- 

fmm this librarv over several rounds of selection, returning fi d &om me culture supernatant by two rounds ot precipi- 

Sumter of! Merent but related zinc fingers which bind the ^, ionusine o.2 volumes of 20% PEG/2.5MNaa containing 

given DNA. By comparing similarities in the ammo acid 50 M ^(CIU.COO),, and resuspended m zmc finger 

Souences of functionally equivalent fingers we deduce the na £ buffer (20 mM HEPES pH7.5, 50 mM NaCl, 1 mM 
S mode o "mteracLn of these fingers with DNA. 40 ^ and „M Zn(CH3.C00) 2 ). Streptavidm^ated 

Remarkably, it would appear that many base contacts can param agnetic beads (Dynal) were washed in zinc finger 

owrfromtnee primary positions on the a-helix of the zinc » ^ md b i ocked for 1 hour at room temperature 

finger correlating (in hindsight) with thc implications icf the ^ lh 6 , he same buffer made up to 6% in fat-free dried milk 

S structure of Zif268 bound to DNA(Pavletich & Pabo (Marvel) . Se i ection of phage was over three rounds: in he 
lWD The ability to select or design zinc fingers with 45 U round, beads (1 mg) were samratcd with bio mylated 

desired specificity means that DNA binding proteins am- oli g 0 nucleotide (-80 nM) and then washed prior to phage 

tatning z nc fingers can now be "made-to-measure". ^ng, but in the second and third rounds 1.7 nM obgo- 

S 8 , nucleotide and 5 fig poly dGC (Sigma) were added to the 

Materials and Methods beads ^th the phage. Binding reactions (1.5 ml) tor 1 nour 
Construction and cloning of genes. Tne gene for the first so at 15" C. were ^c^erphage buffer^ *2%m 

three fingers (residues 3-101) of J^nscnption Factor \\\K fe^fiee ^mtiK V M ) ^ ^ 

(TFIIIA) was amphfied by PCR from the cDNA cfonc of gpcaU? . ~ ncd %%J> bu g ffer . Phage were eluted by 

TFIIIA using forward and backward primers which contain times wi h 1 ml 01 " for 5 m ? and neut ralise<l 

restrictionsitesforLNodandSfil respectively. l^Senefor shaking inO. IMWemy £ ^ 

the Zif268 fingers (residues 333-420 was assembled from 55 wuh an equal volume ^ ^ ^ ^ ^ 

8 overlapping synthetic ohgonucl eotides giving i M and TGI m xl Y ^ w£re 

SI ^&^?S5£^^^ " ed * ^ -» diluti0DS of the iDfec,ed bacte " 

to end ligation using 3 short complementary hnkers, and na_ procedure, based on affinity 

amplified by PCR from the s.ngle strand usin^ ; ^2 wd ?™J™** ^ m no. I: zinc 

backward primers which contained sites for Nod and Sfil P™^?> ^ on , he sttrfacc of fd pb age(B) as 

respectively. Backward PCR primers in . addition imroduced io (Q m third MgM is 

Met-Ala-Glu as the first three amino acids oi hta ^zmc ^finger has ons to me v 2dc finger phage are 

peptides.andme^werefollowedbythete^^^^ ^^^^^^yhted DNA oligonucleotide [D ] attached 

p^^^rrK^o^X MalTaf^y 1. lot. Chel 267, 16007-16105). 
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fingers of Zi£268^ndUu;zmc b K & . seque nce 

SBSssss'sfflas'. w ■■»•• 
sss aasas."a s sa — - 

Corp.). 
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..-I „f 'X Wer DNA-Binding Domains from 
J&f *^ra68 ftteto the construclon of a phage 

P1 T ? Fd S SfS pSS Ly expe« the inventors 
vector Fd-let-fciN. inp^ ' ' N _ termma l fingers 

fused to the I- MoL 

Phage *^^i^3S£Kk ** 35 
binding to specific DNA 0 "8° n " c ' f Ucd m both 

zinc fingers were ^^ d ^aTe^5* specific oligo- 
Zif268 were diluted l:1.7xiu wuu _ r t u 



so that relative to the first residue in the a-heUx (position 

occurring residues at that position. 
The logic of this protocol, based upon the Zif268 crystal 

X "oSSwWA contacts is fixed by its two neigb- 

the randomised finger arc the most ""F*"^ 9 ™* 
specific complexes with DNA of known *qu nce^Smce 

binding (Jacobs 1992). 

The size of the phage display library required, asking 
fi.it rieoeneracv of the 8 variable positions, is (16 x- >0.« 
S but Se of practical limitations in the emciency ot 

£ timSTmaTer thanle theoretical ^ 

zinc finger motif. 

SSSSaS SraVphie seated after 3 rounds 
filing with variants of the Zif268 operator. 



zm6S were diluted " '^.^ inc , lbated with beads 
bearing Tine «^«^SonL to three of the total 45 _ 

coated with ZlQ f°^ZdZa E coli were shown by 
phage eluted and transfected into i. c ott w 
Uny to , carr, £ Bfi» ^ 
enrichment factor ot over _w ffl . ^ 

Hence it is clear that zinc ^fingers displayed ^VJ 

sirs K»f) 



TABLE 1 



CAG 



TGA 



GAA 



GAT 



1 
9 

3 
1 
(3) 




GAC 





jpGSLTSHRK 

£ I § 

IggSlvShlr 
|gg|lg8hmk 
|rsSll|htr 
&qs$ly§hqr 

lASttLliHQR 

III 

|rs$le$htr 




Sal. 
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GCC 



GTC 



GCA 



GCT 




IrsIli^ktr 
|rt|ls|hir 

|a&|lv| hsk 

$GG§LA&HER 



ACG 



ATG 



GTA 



(1) 
(2) 



(1) 




10 
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TTG 



CCG 



GCG 



GTG 



|rt|ls|hir 
$arSlt$hqr 

$GGSLA$I 

SraSlaShmr 

|rg|la|her 

|gd|lt|her 
|ad|lm|hkr 

£ $ & 
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Ed to contact the DNA. For f^£j°**£g& 
from the 8 different clones selected with the trip et GM 
Se%allhaveAsn^^^ 

uf as does the first zinc finger of the Drosophila protein 
£££fa which they are seen making «J* » f 
same triplet in the cocrystal with specific DNA^airall et at., 
S ) Sis indicates^ the P^.»™»£i 
particular amino acid in functionally equivalent fingers is 
utfkdv to be coincidental, but rather because it has a 
foiSaUole. Thus using data collected from the phage 
5^ ritorarv (Table 1) Uis possible to infer most of the 

20 the results can be rationalised in terms of contacts &om the 
primary a-hclical positions (-1, + 3 an £6) . A*W 
by X ray crystallography (Pavletich & Pabo 1991) and 
database analysis (Jacobs 1992). 

1991 Science'252, 809-817) and txamUack (H«all et aL, 
1091) In these structures, and in almost all of the seleded 
K which ArgrecognisesG at the ,y end. As .occurs 
at position + 2 to buttress the long Arg side chain (e.g. Table 
lop) When position -1 is not Arg, Asp rarely 
testing that in this case any other contacts it might make 
« wUh the Lond DNAsttand do not contribute sigmficantly 
to the stability the protein-DNA complex. 

Adenine is also an important determinant of sequence 
J^ifiritv recoenised almost exclusively by Asn or din 
££S to make bidentate contacts (Seeman et 
7 1976) When A is present at the 3' end of a triplet, Ota 
50 Position -1 of the 

-rasas »sft- - -r„; t* — r * 

7n h rSS rnSLl. In addition to the DNA triplets shown J*^^^ bonding amino acid side chain in 

here oE weT also used in attempts to select zinc finpr ^^J^ e t al., 1976). Nevertheless, C m 

Setom the library, but most selected lm> clones one toe W ;^^ ani ^prf~l^ 

£ing the a-helical sequence KASNLVSHIR, (SeqID Na the 3 p , . Table 

119) and the other having the sequence "JJ^™ leg) Asp is also sometimes selected at +3 and +6 when C 

ccr, err, rrc, agt, cga, cai, aga, aoc ana yw. 



i 

3 
1 
3 
1 

1 
1 
1 
1 
1 
2 
1 




35 



40 



|ld$lr|hlk 
$gd&lt§her 
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respectively Although Asp can' accept a hydrogen bond from Zif268 is randomised and screened with tetranucleotides to 
Kno g^up of C, o P ne shouki note that the positive take into account end effects such as addtUonal contacts 
molecular charge of C in the major groove (Hunter 1993 J. from variants of this finger. Only 4 positions (-1 +2, +3 and 
Mol Biol 230 1025-1054) will favour an interaction with +6) were randomised, chosen on the basis ot the earlier 
Asp'regardless'of hydrogen bonding contacts. 5 X-ray crystal structures. The results presented above in 

However C in the middle position most frequently selects which more positions were randomised, to some extent 
Th?^ Table % Val or Leu (e.g. Table lo) at + 3. justifies Rebar and Pabo's use of the four random posi tons 
SimilaX Tin the middle position most often selects Ser without apparent loss of effect, although further selections 
I g Talk li "Ala or Val (e.g. Table Ip) at + 3. The aliphatic may reveal that the library is compromised However, ran- 
ammo actte are unable to make hydrogen bonds but Ala io domising only four positions decreases the theoretical 
P Xbly has a hydmphobic interaction with the methyl library size so that full degeneracy can be achieved in 
Sp of T whereas a longer side chain such as Leu can practice. Nevertheless the inventors found that the results 
Sde T and pack against the ring of C. When T is at the obtained by Rcbar and Pabo by screening the.r complete 
5' end J a triplet sR Tnr am selected at + 6 (as is library with two variant Zif268 operators, are in agreement 
occasionally the case for G at the 5' end). Thymine at the 3' 15 with their conclusions derived from ar .incomplete library 
end of a trirfet selects a variety of polar amino acids at -1 On the one hand this again highlights the versatility of zmc 
c g. Tab e ?d) and occasionally remrns fingers with Ser at fingersbut, remarkably.se , far -bod. studies havebeenun^e 
Se g l^ble a) which could make a contact as seen in the to produce fingers which bind to the sequence CCL It wffl 
Smtrackc^s a. structure (Fairall et al., 1993). be interesting to see whether sequence biases such as we 

m^sTphaee Display ™ have detected would be revealed, if more selections were 

From Tabl 1 If cant seen that a consensus or bias performed using Reba, and Pabo's library. In any case, it 
usuaUv occurs in two of the three primary positions (-1, + 3 would be desirable to investigate the effects on selections of 
^^^rf^^tflngSwtUto using different numbers of randomised poatmns ,n more 
in many cases Phage selection is bv virtue of only two base complete libraries than have been used so far. 
"omacts p« finger' as is observed in the Zif268 crystal 25 The original position or context of the randomised finger 
trucmre (Pavleti h & Pabo 1991). Accordingly, identical in the phage display library might bear on the efficacy of 
Sequences are often returned by DNA sequences selected fingers when incorporated into a new DNA-binding 
duf rinj by one base in the central triplet. One mason for domain. Selections from a Hbrary of the outei -fingers of a 
tois is That the phage display selection, being essentially three finger peptide (Rebar & Pabo, 1994 Science 263, 
Stotolby affinity, can yield zinc fingers which bind 30 671-673; Jamieson et al., 1994 Biochemistry 33 
CuytWytoanumberofDNAtripletsandsoareunable 5689-5695) am capable of producing fingers which bind 
to dStinaL Secondly, since complex formation is gov- DNA in various different modes while selections from a 
ra edbythelawofmassac,ion,affinityselectioncanfavour libraryof the middle finger shoul ^ P^ucc^Ufe wta^ are 
those clones whose representation in the library is greatest mom constrained. Accordingly Rebar and Pabo do not 
even though their true affinity for DNA is less than that of 35 assume that the first finger of Zff268 will always bind a 
olher Ss less abundant u, the library. Phage display triplet, and screened with a tetranucleotide . binding Site to 
Son by affinity is therefore of limited value in dislin- allow for different bind.ng modes. Tlius mo Ufcselected from 
gutn ng between permissive and specific interactions libraries of the outer fingers might prove less amenable to 
beyond those base contacts necessary to stabilise the com- the assembly of multifinger protems, since bindtng of these 
J e7iSth "bsenceof competition from fingers which 40 fingers could be perturbed or .constraining hem to a par- 
are able .0 bind specifically to a given DNA, the tightest ticular binding mode, as .would be thecase 
non-specific complexes will be selected from the phage had to occupy the middle position of an assembkd thrce- 
ZrT Consequently, results obtained by phage display finger protein. In contrast motifs se ected from libraries of 
Son from a library must be confirmed by specificity the middle finger, having been originally constramed, wdl 
assS paSarly when that library is of limited size. 45 presumably be able to preserve their mode of brndins ; even 
Conclusion when placed in the outer positions of an assembled DNA- 

The amino acid sequence biases observed within a family binding domain, 
of functkmally equivalent zinc fingers indicate that, of the FIGS. 10A-10C shows different ^ate^s for die design 
a-hXal positions randomised in this study, only three of tailored zinc finger protems (A) ^ three-h^er DNA- 
primary(-l +3 and +6) and one auxiliary (+2) positions are so binding motif is selected en btoc frorn a hbrary of three 
LXd in recognition of DNA. Moreover, a limited se. of randomised fingers. (B) A three-finger DNA-bmdmg motif 
lino acids are ,0 be found at those positions, and it is is assembled out of independently selected 
presumed that these make contacts to bases. The indications hbrary of one randomised finger (e.g. the middle finger of 
Core are thaTa code can be derived to describe zinc Zif268). (C) A three-finger DNA-binding motif is assembled 
finger-DNA interactions. At this stage however, although 55 out of independently selected fingem from three posmonaUy 
sequence homologies are strongly suggestive of amino acid specified libraries of randomised zinc fingers, 
preferences for p^icular base-pairs, one cannot confidenUy FIG. 11 illustrates the strategy of combmatorial a^mbly 
deduce such rules until the specificity of individual fingers followed by en bloc selection. Groups ot nplet-spec^fic zinc 
for DNA triplets is confirmed The inventors therefore defer fingers (A) isolated by phage display selection are 
making a summary table of these preferences until the 60 assembled in random combinations ; and re-displayed on 
foSg example in which is described how randomised phage (B). A full-length target site (C) is used to select en 
DNA binding sites can be used to this end. bloc the most favourable combination of fingers (D). 

While this work was in progress, a paper by Rebar and EXAMPLE 2 

Pabo was published (Rebar & Pabo 1994 Science 263, . . 

671-6731 in which phage display was also used to select 65 This example describes a new technique to deal efficiency 
zinc fingers with new DNA-binding specificities. These with the selection of a DNA binding site for a given zinc 
authors constructed a library in which the first finger of finger (essentially the converse of example 1). lhis is 
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. t j ,„i,i, pTj«/7n The "bound" strand of each oligonucleotide 
desirable as a safeguard against spurious selections based on gj™^, svnthe , icaU y and the other strand extended 
the screening of display Cranes This may be , do* by Jbrary ^ umversa , prinier using DNA poly- 
screening against libraries of DNA tnplet tandrng s,J* ^ J J fragmeot) . paw. reasons , were , added to 
randomised in two positions but having one base fixed I in the v DMA library in each) in PBS Zn for 15 
tod position. The technique is applied here to determine the 5 W ^ ^ ^ ^ q % 
specificity of fingers previously selected by phage display. d again wilh PBS/Zn. Overnight bacterial 
Tte inventors found that some of these fingers are able to ^ ^ g ^ ? b were 
S a unique base in each position of the cognate triplet. fa 2xTY ^^ipg 50 mM ^(CHS.COO), and 15 
This to mrther illustrated by examples of lingers which can J™ Une ^ ^ Q ^supernatant* contrnmng 
dSrtotoate between closely related triplets as measured by u ™ were diluled tenfold by toe addiUon of PBS/Zn 
S^Sive equilibrium dissociation constants. Compar- 2% fat .f rce dned milk (Marvc^ 
w thf^ ino acid sequences of fingers which specify a 20 /<g/m i sonicated salmon sperm DNA. Diluted phage 

achieved usmg a small set ot ammo acio r ^ ^ ^ pBsyZn jj ound phag6 were 

amenable to a code. Hrtecte'd as described previously (Griffiths et at., 1994 

One can determine the optimal binding sites of these (and dented as de*n p HRp >^ d anti-M13 IgG 

other) proteins, by selection from bbranes ; of_ ra^auscj P and ^ antitat ^ using software package SOFT- 

DNA. This approach, the principle of which is essentwuy i 2 .32 (Molecular Devices Corp), 

die converse of zinc finger phage display, would prov,de an MAA ^ ^ ^ fa ^ ^ ^ g . yes ,^ b ding 

equally informative database from which the same ruks can rf .^.^ ^ fingcr phage . The figure 

be independently deduced. However until now, the favoured 8^ b of ^ fingef phage «, tando m,sed DNA 

method for binding site determination (rcvolv mg Uer.uvc P fa ^ ^ rf micK)titre lates . To test each 

selection and amplification of target DNA _ followed by fl r phage against each oligonucleotide : lib r ary (see 
senuencmg), has been a laborious process not co^ ^ 

applicable to the analysis of a large database (Th esen & >< f ^ (across ^ plate ) contoin 

£h 1990 Nucleic Acids Res. 18, 320 3-3209^ Mock & JjJ»*£ ()f , Mm of a .« nc finger phage The 

Treisman 1990 Nucleic Acids Res. 18, 6197-6204). .m rf ^ librafy ^ ^ as the midd i e triplet ot the 

This example presents a convenient and rapid new ()f af268 opera ,„ r , where N represents a 

method which can reveal the optimal binding site(s) of a • of aU 4 nucleotides . The zinc fingcr phage is 

DNA binding protein by single step selection from small ed me sequence c f the variable region of the 

libraries andtse this to check the binding site preferences of ^ ^ mmhQtcd kMvc to the first helical residue 

those zinc fingers selected previously by phage display, tor on ^ ^ , hree primary recognition positions are 

.his applicatfon, the inventors have used 12 different mini- £ phag6 are detected by an enzyme immu- 

libraries of the Zif268 binding site, each one with the central * ^ roxiinate s , ren gth of binding is indicated by 

triplet having one position defined with a particular base pair ' le propor iional to the enzyme activity. From he 

and the other two positions randomised. Each library tee- ^ ofbindmg toDNAhbraries,caUedthe''s.gnature of 

fore comprises 16 oligonucleotides and offers a number of ^ m& ^ a ^ mmber of b i ndin g sites can be 

potentialbinding sites to the middle finger, provided that the ^ Qft . and ^ aK on th e ngh t of the figure, 

latter can tolerate the defined base pair. Each zinc finger ^^^oti of Apparent Equilibrium Dissociation Con- 

phage is screened against all 12 libraries iW^i^j ^ 

billed in wells of a microlitre plate, and binding is detected 0vernight bacterial cultures were grown in 2xTY/Zn/Tet 
bv an enzyme immunoassay. Thus a pattern of acceptable ^ 0 Q containing phage were dduted 
bLes at each position is disclosed, which the inventors term ^ ^ addition of p BS/Z n containing 4« fat-free 
a "binding site signamre". The information contained in a dM ^ (Marvel)> 2 % TV-een and 40 /ig/nil sonicated 
binding site signature encompasses the repertoire of binding sperm DNA Binding reactions, containing ; appro- 
sites recognised by a zinc finger. riate concentrations of specific 5' -biotinylated DNA ana 
The binding site signatures obtained, using zinc finger . volumes 0 f zinc finger phage solution, were allowed 
phSe Sted as described in example 1, reveal that the ^ to q equilibrate for 1 h at 20° C. All DNA was captured on 
selection has yielded some highly sequence-specific zinc strepta vidin<oated paramagnetic beads (500 w pet wll) 
finger binding motife which discriminate at all three post- ^ were subseq nently washed 6 times ifrth PBS/Zn 
tioL of a triplet. From measurements of equilibrium d«so- containiog 1% Tween and then 3 times with PBS/Zm Bound 
dato constants it is found that these fingers bind tightly to were detected using HRP-conjugated anU-M13 IgG 
the triplets indicated in their signatures, and discriminate J5 (phannacia) and developed as desenbed (Gnffiths et al 
aaainst closely related sites (usually by al least a factor of im) densities were quantitated using software 
ten) The binding site signatures allow progress towards a , goFTMAX 2.32 (Molecular Devices Corp), 
s^cificitv code for the interactions of zinc fingers with v ., le s results are shoW n in FIGS 5A-5F, which is a series 
SET y of graphs of fractional saturation against concentrabon of 
w ., , Mlh , e 60 DNA (nM). The two outer fingers carry the native sequence, 
Materials and Methods ^ ^ wo mtt DNA tfiplets . ^ sequ ence 

Binding Site Signatures „. , of amino acids occupying helical positions -1 to +9 of the 

Flexible fiat-bottomed 96-well microtitrc plates Falcon St'ete shown in each cL. The graphs show that 

were coated overnight at 4' C. wifii strep.av.din (0 1 m^ml varied finga r are s ^ 

in 0.1M NaIIC0 3 pH8.6, 0-03% NaN 3 ) Jells were blocked mid die finger ^ determi _ 

^2 % 1^ffi So/apparent equiUbnum dissociation constant, as 

Ss/Zn containing 0.1% Tween, and another 3 times below. 
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Estimations of the K, are by fitting to the equation 
Kr[DNA HP ]/[DNA.P], using the software package 
XAT EIDAGRAPH™ Version 2.0 programme (Abelbeck 
StwarTow^g to the sensitivity of the ELISA used to 
detect protein-DNA complex, the inventors were able to use 
zinc finger phage concentrations far below those ^of the 
DNA, as is required for accurate calculations of the K d . lUc 
technique used here has the advantage that while the con- 
centration of DNA (variable) must be known accurately^hat 
of the zinc fingers (constant) need not be known Choo & 
Khm 1993 Nucleic Acids Res. 21, 3341-3346). Hns cir- 
cumvents the problem of calculating the number of zinc 
finger peptides expressed on the tip of each phage, although 
since only 10-20% of the gene HI protein (pffl) carries such 
peptides one would expect on average less than one copy per 
phage. Binding is performed in solution to . prevent any 
effects caused by the avidity (Marks et aL, 1992) of phage 
for DNA immobilised on a surface. Moreover, in this case 
measurements of K, by ELISA are made possible since 
equilibrium is reached in solution prior to capture on the 
solid phase. 

Results and Discussion 
The Binding Site Signature of the Second Finger of Zif268 
The top row of FIG. 4 shows the signature of the second 



acids at position -1 and these are listed as pairs. Ser or Thr 
at position +6 permit Asp +2 of the following fanger (denoted 
Asp ++2) to specify both G and T indirectly, and the pairs 
are listed The specificity of Ser + for T and Thr +3 for C may 
be interchangeable in rare instances while Val +3 appears to 
be consistently ambiguous. 

TABLE 2 
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5' 



POSITION IN TRIPLET 
MIDDLE 



Arg+6 
Ser +6/Asp ++2 
Thr +6/Asp ++2 

Ser +6/Asp ++2 
Thr +6/Asp ++2 

Val +3 

Asp +3 



His +3 Arg -1/Asp +2 



Asn +3 
Ala +3 
Ser +3 



Leo +3 
Thr +3 
Val +3 



Gin -1/Ala +2 

Asn -1 
Gin -1/Ser +2 



Asp - .1 



respectively iu wo wajui siw/' - — ? * 

+i of finger 3 is dominant over the amino acid present at 
position +6 of the middle finger, precluding the possibility of 
recognition of A or C at the 5' position. Future libraries must 
be designed with this interaction omitted or the position 



The binding site signatures also reveal an important 

The top row of FIG. 4 shows the signature o, m secono 

finger of wild type * 3 eg 2£ rf Ae amino acid present at position +6, 

indicating binding to ohgonucleotide hbrane hW 0NN. paneU Q and T „ the 5 . end of a 

TON, NGN and NNG as the middle triplet, n emergent ar able to gh ^ for ^ fc ^ ^ 5 , 

the optimal binding sUe for th,s finger .s 170,0.0. ■ - accord £p £ P ^ . & ^ ^ ^ Q Qr by , 

with the publisher "^.^^^f ^STs * Sc. from the invariant Mp at position + 2 of finger 3 to 

1989 Proc. Natl. Acad. Sci. USA 86, 8737-8 ni )■ ims nas complementary strand, 

implications for the interpretation of the X-ray crystal struc- ^ partner . °^^ D4 * zif26g f Pavletich & Pabo 

mre of Zif268 solved in complex with operator to tg* (Fairall ct ^ 

having, TOG as the middle triplet <P«vlc tich J^Sd statutes (/contact to NH 2 of C or A 

ing an equivalent contact to be possible with N7 of A, but 
from the binding site signature we can see dial there is 
discrimination against A. This implies that the His may 

prefer to make a hydrogen bond to 06 of O or a bifiimated TV r^j^^i of the conserved 
hydrogen bond to both 06 and N7, or that a stenc clash wi h 40 yaned 1 ideQtify , ^ in the 

the «L> group of A may prevent a tight mteracto.tr wuh regmns of the fee ^™ ^ 
misbasc.Thusbyconsidermgtheste^ both G and T viz the occurrence of Ser or Thr at position +6, 

helicalDNA,bindingsitesignaturescangtvems.ghtmtothe hvd bond t0 either baS e. 

details of zinc finger-DNA interactions. Modulation of Base Recognition by Auxiliary Positions 

Amino Acid-baseContacUinZmcFrnger-DNAComplexes 45 ModmaUonotB gn^ ^ ^ ^ ^ ^ ^ 

Deduced from Binding Site Signatures rf j , and can thus work „, 

The binding site signatures of other zinc fing^vea dm^j 3 o^ he og p ^ ^ 

that the phage selections performed in example 1 y*lded ^™ ^ ^ t0 amm0 acid . b ase 

highly sequence-specific DNA bindmg P^.f * e °f 1 toSTee primary positions, indicate that 

these are able to specify a unique sequence for the midd e 5U con^cfc from me p ^ ^ ^ n 

triplet of a variant Zf268 bindmg site, and arc therefore au ™P™^y at position -1, which is specific 

more specific than is Zif268 itself for its consensus sU^ Aclear c« m po. nt . Ua ^ p ^ ^ ^ ^ 

Moreover, one can identify the ^Jfi™^ S^Lf amuS ^dU s2 as Ala, Lugh specific for T 
particular oligonucleotide library that is to say a i Specihc n ° n P° & at hion +2 . The 

base at a defined position, by looking down the columns of 5 5 when *£~S£Sl Alg at ^-1 and Asp at 
FIG. 4. By comparing the *™ SS^S^^«"^^^ ^X^y 

fingers one can identify any res.d^ .s ^^""^ ^ 1 structures of zinc fingers, is another instance of 
preferences for particular bases on bound DNA. With a tew cryswi b pSntions. Thus the amino acid 

exception these are as previous y prahcted on the has.s of ^ Site or enhance the specificity 

of the amino acid at other positions. 

At position +3, a different type of modulation is seen in 
the case of Thr and Val which most often prefer C m the 

for appropriate triplets. Cognate , amino acms „ j£££*tfc2ft ^SSSSS^SH 
positions m the a-hefix are entered m a matnxrelaung each « te recogms^ , bott ^ Mictions involving 

b p So^^^^ [te^^ofthJresmues.andhereafiexMitym 



eXCCpUUU5», llic^fc oiw w> ^ x — 

phage display, and are summarised in Table 2. 

Table 2 summarises frequently observed amino acid-base 
contacts in interactions of selected zinc fingers with DNA. 
The given contacts comprise a "syllabic" recognition code 
for appropriate triplets. Cognate amino acids and their 
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, . „ ,u .„ pffprt (mm Bern who have derived similar rules by altering zinc finger 

the inclination of the finger rather than an effe from ^KiSiigdata basc-guidcd mutagenesis (Desjarlais & 

another position per se may be the cause of ambiguous ^ ^ sd (* A g? 7M5 _ 7349 . 

reading. TVsiarlais & Dere 1993 Proc. Natl. Acad. Sci. USA 90, 

Quantitative Measurements of Dissociation Constants ?£S n ? 

^thi rL^aS £5 Utable at each z^fi^may i^^^S 

position of the triple,. Furthermore, because two base posi- tt» o 2Sy of^consistencies may be 

tions are randomly occupied m any one J brary of ataU. ^JgJ^J^ 4^ in ^ inclination of 

oligonucleotides, binding site signatures are no, fonnaUy accounted to ^cons, g ^ t0 thc 

able to exclude the possibility of context dependence for is ^^^^^S- It a^ralhat ftTidentity 

some interactions. Therefore to supplement binding stfe a .^%ioo is attuned to 

ean be isolated using a competitive selection V™**- ^X^Clw) An alphabetic code 

discrimination. Also, the ambiguity in certain binding site Uon o J ««* ^ » *J « " f u \ „ ms - 

■*T"**-™^5Jffl^S5S2^ U ™abic" code of interactions with DMA is made 

the ammo acid^quencc RGDALTSHER (Seq ID No. lUUj y ^ ^ framework 0 f ,he zinc finger, this 

that in the case of Zif268-like zinc fingers, DN A recognition ^^^^ lhe versatjHly of the finger motif 

^STpln^^^^ niayreauirestructuralanalysesforclarification.'Ilius.water 
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lion factors recombinases, nucleases etc. for a wide range of 
iSS^Tte mentors have shown that zinc linger 
ZiXmains can discriminate between closely related 
DNA triplets, and have proposed that they can be hnked 
together to form domains for the specific ^cognition of 
ongeVDNA sequences. One interesting possib.li y for the 
use g of such protoin domains is to 
differences in pathogens or transformed cells. Here one sucn 
application is described. 
There exist a set of human leukaemias in which a recip- 

a Truncated chromosome 22, the Philadelphia chromosome 
fPMtf encoding at the breakpoint a fusion of sequences 

— - „on\ ii... urn ntn ✓firoffen el al, ten 



mav be seen to plav an important role, for example where 
fi rid? Sains such as Asn or Ser interact with bases 
torn position -1 (Qian et al., 1993 J. Am. Chen, Soc 115, 
1189-1190; Shakked et al., 1994 Nature (London) 368, ^ 

^EvtSally, it might be possible to develop a number of 
Jl SSmg zinc finger binding to DNA, which could 
™«lict the binding site preferences of some zinc fingers 
^LfaS addsecJience.m&nctional amino acids 
Sected a. positions -1, + 3 and to an extent + 6 in this ; stody, to 
are vcrv frequently observed at the same positions in natu- 
rafiv olui fingers (e.g. see FIG. 4. of Desjar ais and 

Bcre 1992 Proteins 12, 101-104) supporting the existence ,Ol c-ABLpromoucugsu* — - — - " - _ .. 

Scontacts from these three positions. However, the ack ^ ^ BCR gene (G rolTen el al, V984 CM 

of definitive predictive methods is not a serious pracucal is chronic myelogeDO us leukaemia (CUL) the 

ot dennitive p UUmfml te( , hniQIies (here and in ^ ^ usuaUy in the fiist intron of the c-ABL 

acne and in the breakpoint cluster region of the BCR gene 
Sniivelman et al„ 1985 Nature 315, 550-554) ,^n« 
to a n210 sc * Ai ' i gene product (Konopka et al., 1984 Cell 
37 1035-1042). Alternatively, in acute lymphoblastic leu- 
kaemia (ALL), the breakpoints usually occur m the first 
introns of both BCR and c-ABL(Hermans er al., 1987 Cell 
51, 33-K>), and result in a p l9(r«~ gene prcKlnct (FIG. 
6) (Kurzrock et al., 1987 Nature 325, 631-635). 

FIG 6 shows the nucleotide sequences (Seq ID No.s 
9-11) of the fusion point between BCR and ABL sequences 
in pl90 cDNA, and of the corresponding exon boundaries in 
the BCR and c-ABL genes. Exon sequences are wn ten m 
. , l.:i~ ;^*™e »rA aive.n in lowercase. Line 1 
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ZSZ'l = la=V techniques (here and in 
Sen & Bach 1990 and Pollock & Tresraan 1990) wdl 
XTthe identification of binding sites tor a given DNA- 
Sng Protein. Rather, one can apply phage sdecUona^d 
a know edge of the recognition rules to the converse 
problem namely the design of proteins to bind prede.er- 

SstcXleDesignofDNA-bindingProteins 
Pr °SS to manipLte the sequence specificity -of anc 
fingers impfies that we are on the eve of designing DNA- 
Bkg proteins with desired specific!* m 
medicL and research (Desjarlais & Berg, 1993; Rebar & 

Pato 1994).Thisispossibleliecause,byconlrasltoal[otner C -A»l.genes. c«>» ~-t ■ r : . 

DNA-binding motife, we can avail ourselves of the modular «™£ae» while introns are given in lowercase. Lme 1 

n^ure ofthe zinc finger, since DNA sites can be recognised 30 capUal ^ C DNA; line 2 the BCR genomic 

by appropriate combinations of independently acting, fin- Jj£P ^ junction of exon , ^ ^ 1; and fine 3 the 

gers linked in tandem. abl genomic sequence at junction of intron 1 and. exon I 

8 The coded interactions of zinc fingers with DNA can be * ^ ^ m7) ^ 0 bp 1D tbc p 190» 

used to model the specificity of individual zinc fingers de * ^ as , targ e, is underlined, as are the homologous 

novo, or more likely in conjunction with P^ display 35 fa jc BCR aQ(J c .abl. 

selection of suitable candidates. In this way, according to H rearranged genes act as dominant 

SSmenLs, one could modulate the affinity for a gtveu J™° S ° s k cell culture (Daley et al., 1988) 

K site or even engineer an a^ropnate degree of ^™ ic ^ p^terkamp et al, 19M Na^ 344 

indiscrimination at particular ba.se potions .Moreover, the » ^ theif genomic counterparts, the cDN As bear 
additive effect of multiply repeated domains offers the 40 unique nucleotide sequence at the fusion point 0 J^*J* 

opportunity to bind specifically and tightly to extend* 1 and ge n es , which can be recognised at the DNAlevel 

hence very rare, genomic loci. Thus zinc finger proteins anac ab g n tein n< , presenl inven . 

S£ weU be a good alternative to the use of an.isense ^^g^ such a pro tL to recognise the unique 

Sic acids in suppressing or modifying : ^ achon of «g* fa g C . DNA . Tte to« 

civen gene, whether normal or mulant. To this end, extra breakpoints in the spontaneous 

fo„cuo 8 ns could be introduced to these DNA bmdmg ^J^^ whic h are though, to be vanable 

domainsby appending suitable natural or synthetic effectors. ge ^ ^ d6sign of such p6pUd6S has 

EXAMPLE 3 implications for cancer research, the primary aim here is to 

resulting in blockage of transcription. ™ S ,o .Kger selected for binding to GCA. Aphage 

cells made growth factor-uidependent by the action oi to ^.v mimZary of putative BCR-ABL-binding three- 

oncogene (Daley et al, 1988 Proc Natl. Acad^c^S.A ghy mm U £ ^ ffl comprisi SIX 

85, 9312-9316) are f« to revert ^^^Z Sle combinations of the six selected or designed fingers 

transient transfection with a vector expressmg the designee poKiore co ^ ^ fa ^ appropnate order . 

zinc finger polypeptide. L^e fineers are illustrated in FIG 7 (Seq ID No.s 12-17). 

DNA-binding proteins designed to recognise J ^ ™^ of secondary structure are underlined 

DN A sequences could be incorporated in chimeric transcnp- In Mb. reg 
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below the list while residue positions are given above, was capable of site-specific DNA-binding, in vivo. The 

fettL to tie tat position of the a-helix (position 1). Zinc peptide was fused to the VP16 action domau, from 

finger phages were selected from a library of 2.6x10 s herpes s.mplex vims (Fields 1993 ^^^nd 

variants using three DNAbinding sites each containing one used in transtent transaction assays (FIG. 9) to drive pro- 

o^S^QAAoiQ^J&^^mM^ 5 ^lion of a CAT (chloramphenicol accty transfer^) 

(exampl^) indicate that fingers 1A and IB specify the reporter gene from a binding sue "P^^^TATAbox 

Llct GCC, finger 2A specifics GAA, while the fingers (Gorman ct al., Mol. Cell. Biol. 2, 1044-1051). In detail he 

sefited using .he triplet GCA all prefer binding to OCT. experiment was performed thus: reporter plasm ds 

5h latter is finger 3A, the specificity of which we pMCAT6BA, pMCAT6A ^ PMCAl 6B we* cons ructed 

believS,onthebasisofrecognitionrules,couldbechanged by inserting 6 copies of the pl90» target site 

by a point mutation. Finger 3B, based on the selected finger (CGCAGAAGCC) (Seq ID No 121), the c-ABL second 

3A, but in which Gin at helical position +2 was altered to Ala exon-intron junction sequence (TCCAGAAGCC), or the 

should be specific for GCA Finger 3C is an alternative BCR first exon-intron junction sequence (CGCAGGTGAG) 

version of finger 3A in which the recognition of C is (Seq ID No. 123) respectively, into pMCAT3 (Luscher et al., 

mediated by Asp +3 rather than by Thr +3. 1939 Genes Dev. 1507-1517). The anti-BCR-ABL/VP16 

The mini library was screened once with an oligomiclc- expression vector was generated by inserting the in-frame 

otide containing the 9 base-pair BCR-ABL target sequence between the activation domain of herpes simplex 

to select for tight binding clones over weak binders and virus V p 16 (fields 1993) and the Zn finger peptide in the 

background vector phage. Because the library was small, the p EF-BOS vector (Mizushima & Shigezaku 1990 Nucl. 

inventors did not include competitor DNA sequences for Acids Res 18> 532 2). C3H10T1/2 cells were transiendy 

homologous regions of the genomic BCR and c-ABL genes 20 ^.^3^^^ ^th iq mg 0 f reporter plasmid and 10 mg of 

but instead checked the selected clones for their ability to cxpression vector . RSVL(dc Wet ct al., 1987 Mol. Cell Biol, 

discriminate. It was found that although all the selected 7 72 5_737) w hich contains the Rous sarcoma virus long 

clones were able to bind the BCR-ABL target sequence and ^ ^ * ^ lucifcrasCi was uscd ^ an mtorna i 

to discriminate between this and the genomic-BCK normalise for differences in transfection effi- 

sequence, only a subset could discriminate agains the » c°n«oi transfecled by the calcium phosphate 

c-ABLsequencewbich,atthe^ Shat on m hod and CAT assays performed as 

exon 2, has an 8/9 Lcribed (Sanehez-Gareia et al., 1993 EMBO J. 12, 

target sequence (Hermans el al, 1987). Sequencing of he 4M3 _ 42Sfl J plasmid GSEC which has tive consensus 

^^^^"^^^^^^ m n.m« GAI.4-binding P sites upstream from the minimal 
one with the composition 1 A-2A-3B and the other with x _ adenovirus Elb TATA box, and P M1VP16 

1B-2A-3B. Thus both pep , des ^carried the thud finger (3B) P™n ^ 

^^«5«^^^^S^£ ^ A SgdomainofGAL4andtheactivationdomainof 

Ve peptide 1A-2A-3B, hencefortl ireferrec 1 to as the 35 flgj-J, * ™ ^cotife^d with a 
^SS^t^^S^SSSi CATreporterpl^dandananti-BCR-ABI^.expres- 

JffiSSS ^^V^2i£Ll ^StfSSU increase in CAT activity was 

agaTst the similar sequences found in genomic BCR and <o obse ^ n „^^^^ 

C-ABL DNA, by factors greater than an order of magnitude mg copies of the p 190* cD NA target 1 "^oomparad 

(FiaS) ^^e?errLgtoFia8,(wMchmustratesdiscrimina- to a barely detectable increase ^ celk cotra^ected «Jh 

ion in the binding, of the anti-BCR-ABL peptide to its reporter plasmid bearing copies of cither the BCR or c-ABL 

^c^T^c and to like regions of genomic BCR semihomologous sequences, ind.cat.ng m vivo bmdmg. 

and c-ABL) the graph shows binding (measured as an 45 The selective stimulation of transcription mdicates con- 

A „ «„) at various [DNA]. Binding reactions and complex vincingly that highly site-specific DNA-binding can occur m 

detection by enzyme immunoassay were performed as vivo. However, while transient transfections assay binding 

described previously, and a full curve analysis was used in to plasmid DNA the true target site for this and most other 

calculations of the IC/Choo & Klug 1993). The DNA used DNA-binding proteins is m genomic DNA This might well 

were oligonucleotides spanning 9 bp either side of the fusion 50 present significant problems, not least since this DNA is 

point in the cDNA or the exon boundaries. The anti-BCR- physically separated from the cytosol by the nuclear 

ABL peptide binds to its intended target site with a membrane, but also since it may be packaged w.lhm cbro- 

Kj-6 2*0 4xlO~ 7 M, and is able to discriminate against matin. 

genomic BCR and c-ABL sequences, though the latter To study whether genomic targeting is possible, a con- 
differs by only one base pair in the bound 9 bp region. The 55 struct was made in which the anti-BCR-ABL peptide was 
measured dissociation constant is higher than that of three- flanked at the N-terminus with the nuclear localisation signal 
finger peptides from naturally occurring proteins such as Spl from the large Tandgen of SV40 virus (Kalderon el al., 1984 
(Kadonga et al., 1987 Cell 51, 1079-1090) or Zif268 Cell 499-509), and at the C-terminus with an 11 ammo acid 
(Christy el al., 1988), which have in the range of 10" 9 M, c-myc epilope tag recognisable by the 9E10 antibody (Evan 
but rather is comparable to that of the two fingers from the so et al., 1985 Mol. Cell. Biol. 5, 3610-3616). This construct 
tramtrack (ttk) protein (FairaU et al., 1992). However, the was used to transiendy transfect the IL-3-dependent murine 
affinity of the anti-BCR-ABL peptide could be refined, if cell line Ba/F3 (Palacios & Stcinmctz 1985 Cell 41, 
desired, by site-directed mutations or by "affinity malura- 727-734), or alternadvely Ba/F3+pl90 and Ba/F3+p210 cell 
tion" of a phage display library (Hawkins et al., 1992 J. Mol. lines previously made IL-3-independent .by ^integrated plas- 
Biol. 226, 889-896). 65 mid constructs expressing either plS^r " or p210" 

Having, established DNA discrimination in vitro, the abl, respectively. Staining of the cells with the 9E10 anti- 
inventors wished to test whether the anti-BCR-ABL peptide body followed by a secondary fluorescent conjugate showed 
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efficient nuclear localisation in those cells transfected with the gel was blotted onto HYBOND-N (Amersham), 

the anti-BCR-ABL peptide. UV-cross linked and hybridized to an 32 P-labcllcd c-ABL 

The experimental details were as follows: the anti-BCR- probe. Autoradiography was for 14 h at -70° C. Loading was 

ABL expression vector was generated in the pEF-BOS monitored by reprobing the filters with a mouse b-acting 

vector (Mizushima & Shigezaku 1990), including an 11 5 cDNA. 

amino acid c-myc epitope tag (EQKLISEEDLN) SEQ ID Northern filter hybridisation analysis of Ba/F3+pl90 and 

NO: 124 at the carboxy-terminal end, recognizable by the B a/F3+p210 cell lines transfected with the anti-BCR- ABL 

9E10 antibody (Evan et al, 1985) and the nuclear localiza- expression vect or was done. When transfected with the 

tion signal PKKKRKV SEQ ID NO: 125 of the large T anti-BCR-ABL expression vector, a specific downregulation 

antigen of SV40 virus (Kalderon et al, 1984) at the amino- :u Qf ^cr-abl mRNA was ^ in Ba/F3+pl90 ceUs, while 

terminal end. Three glycine residues were introduced down- expression of p 2 i0* c *^ was unaffected in Ba/F3+p210 

stream of the nuclear localization signal as a spacer, to ce J ls 

ensure exposure of the nuclear leader from the folded ' . 

molecule. Ba/F3 cells were transfected with 25 mg of the J* ™ary. the inventors have demonstrated that a 
anti-BCR-ABL expression construct tagged with tte 9E10 15 DNA-binding protein designed to recognise aspecific DNA 
c-myc epitope as described (Sanchez^arcia & Rabbitts sequence in vitro, is active in vivo where directed to the 
1994 PrL Natl. Acad. Sci. U.S.A. in press) and protein by an appended locahsa .on signal it can bind its 
production analyzed 48 h later bv immunofluorescence- sequent m chmmoscimal DNA. Thus is found on 
labelling as follows. Cells were fixed in 4% (w/v) paraform- ° tte ™f ^VT!^, £ S ° ^ 
aldehvde for 15 min, washed in phosphate-buffered saline 20 mgnfmepe^ideblodkstheiMthof me polymera^ posing 
(PBS), and permeabilized in methanol for 2 min. After sta^ng or abomon. The use of a speafac polypeptide m this 
blocking in 10% fetal calf serum in PBS for 30 min, the case «° ^ f ra gemc sequences is reminiscent of ami- 
mouse 9E10 antibody was added. After a 30 min incubation sense ohgonuclcotide- or nbozymc- based approaches to 
at room temperature" a fluorescein isothiocyana.e (FITQ- ***** ** ST 
conjugated goat anti-mouse IgG (SIGMA) was added and „ 1993 Science 261 1004-1012). Like ant.sense 
incubated for a farther 30 min Fluorescent cells were * ohgqnucleoudes, zmc finger DNA-bindmg proteins can be 
visualized using a eonfocal scanning microscope ta.lored against genes altered by chromosomal 
( ifl ti 2O0x\ translocations, or point mutations, as well as to regulatory 
(magn ca on, xj. ™ * in sequences within genes. Also, like oligonucleotides which 
Immunofluorescence of Ba^3 +P 190 and Ba/F3 +P 210 ^ S transcription by triple helix 
cells transient y transfected ^^^^^^ » formatioD « homo p U ^ine-homo Py rimidine promoters 
vector and stained with the 9E10 antibody was done. ms ^-730) DNA-binding 
Expression and nuclear localisation of the anti-BCR- ABL ' ^ ^ bmd {Q vadous uni reg i on / outside gen es, 
peptide was observed. In f 1 ^ 1 ^^^^ but in contrast they can direct gene expression by both up- 
cells show chromatin condensation and nuclear ragmcnta- down . regu lating, the initiation of transcription when 

T^^V^T f Ta « ^?^n^r 35 fused to activation (Seipel er al, 1992 EMBO J. 11, 

Ba/F3 +P 190 cells or transfected Ba/F3 +P 210 cells. 4961^968) or repression domains (Herschbach er al., 1994 

The efficiency of transient transfcction, measured as the Nalure ^ m _ m) In auy case , by acting directly on any 

proportion of immunofiuorescent cells in the population, and . ^ Qwing f^n to a varie ty of protein 

was 15-20%. When IL-3 is withdrawn from tissue culture, site . specific DN A-binding proteins have 

a corresponding proportion ofBa/F3+ P 190 cells are found to 40 ^ tential {Q COQtrol gene express i on , and Meed to 

have reverted to factor dependence and die, whi e Ba/F3+ manipulate the geneli c material itself, in medicine and 

p210 cells are unaffected. The experimental details were as rcscarcn 
follows: cell lines Ba/F3, Ba/F3+pl90 and Ba/F3+p210 

were maintained in Dulbecco's modified Eagle's medium EXAMPLE 4 
(DMEM) supplemented with 10% fetal bovine serum. In the 45 

case of Ba/F3 cell line 10% WEHI-3B-cooditioned medium The phage display zinc finger library described in the 

was included as a source of IL-3. After the transfection with preceding examples could be considered sub-optimal in a 

the anti-BCR-ABL expression vector, cells (SxlOVml) were number of ways: 

washed twice in serum-free medium and cultured in DMEM i) the library was much smaller than the theoretical 

medium with 10% fetal bovine serum without WEHI-3 50 maximum size; 

B-conditioned medium. Percentage viability was deter- H) the flanking fingers botn recognised GCG triplets (in 

mined by trypan blue exclusion. Data are expressed as certain cascs crealing nearly symmetrical binding sites for 

means of triplicate cultures. The results are shown in graphi- the three zinc tingerSj wmcrj enables the peptide to bind to 

cal form in FIG. 9. tne bottom' strand of DNA, thus evading the register of 

Immunofluorescence microscopy of transfected Ba/F3+ 55 interactions we wished to set); 

P 190 cells in the absence of IL-3 shows chromatin conden- 2 of fi ^ CAsp++r) was dominant over 

sation and nuclear fragmentation into small apoptotic the interactions of finger two (position+6) with the 5' base of 

bodies, while the nuclei ot Ba/F3+p210 cells remain intact. ^ middle triplct . 

Northern blots of total cytoplasmic RNA from Ba/F3+pl90 . x „ 1 j • , 
cells transiently transfected with the anti-BCR-ABL peptide 60 ") 00t ^ amin0 acids wcre represented in the randomised 

revealed reduced levels of v 19(f CR ' ABL mRNA relative to positions. 

untransfected cells. By contrast, similarly transfected In order to overcome these problems a new three-finger 

Ba/F3+p210 cells showed no decrease in the levels of library was created in which: 

p210 BCJt ' ABL mRNA (FIG. 12). The blots were performed as a) the middle finger is fully randomised in only four 
follows: 10 mg of total cytoplasmic RNA, from the cells 65 positions (-1, +2, +3 and +6) so that the library size is 

indicated, was glyoxylated and fractionated in 1.4% agarose smaller and all codons are represented. The library was 

gels in 10 mM NaP0 4 buffer, pH 7.0. After electrophoresis cloned in the p CANTAB 5 E phagemid vector from 
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Pharmacia, which allows higher transformation frequencies K d measurements show that the clone with Asn+3 nev- 

than the phage. erlheless binds the mutant G1.2V sequence with a K d in the 

b) the first and third fingers recognise the triplets GAC n M range and discriminates against the wild-type ras 

and GCA respectively, making for a highly asymmetric ^ However it was dicted ^ ^ 3 should 

binding ate. Recognition of the 3' A in the latter triplet by 5 . . .... 

finger three is mediated by Gln-l/Ala+2, the significance of s P eci£ y an ademne residue at the middle P 0Sltl0n > whereas 

which is that the short A1a+2 should not make contacts 10 the the polypeptide we wished to make should specify a 

DNA (in particular with the 5' base of the middle triplet), cytosine for oiptimal binding, 
thus alleviating the problem noted at (iii) above. 

EXAMPLE 5 10 xhus we assembled a thrce-fingcr peptide with a Scr at 

The human ras gene is susceptible to a number of different position +3 of Finger 1 (as shown in FIG. 12), again for 

mutations, which can convert it into an oncogene. A ras using synthetic oligos. This time the gene was ligated to 

oncogene is found in a large number of human cancers. One pCANTAB5E phagemid. Transformants were isolated in the 

particular mutation is known as the G12V mutation (i.e. the J5 K coii ABLE-C strain (from Stratagene) and grown at 30° 

polypeptide encoded by the mutant gene contains a substi- c ? which strain under these conditions reduces the copy 

tution from glycine to valine) Because ras oncogenes are so numbcr of lasmids ^ as tQ makc ^ toxic ducts less 

common m human cancers, they are extremely significant abundant in the cells 
targets for potential therapeutic methods. 

A three finger protein has been designed which can 20 

recognise the G12V mutant of ras. The protein wjk produced ^ amino acjd w (s ID Nq lg) of ^ fi 

usmg rational design based on the known specificity rules. h shown ^ nG n ^ rcfcr ^ * 

In outline, a zinc finger framework (from one of the fingers . . * ^ ~ /T , A ™ Q . . , 

selected to bind GCC) was modified by point mutatioas in ™ 8C1 f f ldu f ' ™ C & ^ & ra > blnd to the 

position +3 to yield fingers recognising two additional „ G12V mutant nucleot.de sequence: 

different triplets. The finger recognising GCC and the two 5 , GAC GGC Gcc , 

derivatives were cloned in pCANTABSE and expressed on "^J "JJ 
the surface of phage. 

Originally, the G12V-binding peptide "r-BP" was to be 
selected from a small library of related proteins. The reason 30 

a library was to be used is that while it was clear to us what ^ e A shows the single point mutation by which the 

8/9 of the amino acid:base contacts should be, it was not G12V sequence differs from the wild type sequence, 
clear whether the middle C of the GCC triplet should be 
recognised by +3 Asp, or Glu, or Scr, or Thr (sec Table 2 

above). Thus a three-finger peptide gene was assembled 35 Assay of the protein in eukaryotes (e.g. to drive CAT 

from 8 overlapping synthetic oligonucleotides which were reporter production) requires the use of a weak promoter, 

annealed and ligated according to standard procedures and When expression of the anti-RAS (G12V) protein is strong, 

the -300 bp product purified from a 2% agarose gel. The the peptide presumably binds to the wild-type ras allele 

gene for finger 1 contained a partial codon randomisation at (which is required) leading to cell death. For this reason, a 

position +3 which allowed for inclusion of each of the above 40 regulatable promoter (e.g. for tetracycline) will be used to 

amino acids (D, E, S & T) and also certain other residues deliver the protein in therapeutic applications, so that protein 

which were in fact not predicted to be desirable (e.g. Asn). excccds the K d for the G12V point mutated gene but not the 

The synthetic oligonucleotides were designed to have Sfil K d for the wild-type allele. Since the G12V mutation is a 

and NotI overhangs when annealed The -300 bp fragment naturally occurri ic mMan (not on , a cDNA 

was ligated into Sfil/Not -cut FdSN vector and the ligation 45 mutation as was thc 190 bcr . abl) human ccll ^ andothcr 

mixture was electroporated into DH5a cells. Phage were ^ { modeJg caQ be ^ ^ 
produced trom these as previously described and a selection 
step carried out using the G12 V sequence (also as described) 

to eliminate phage without insert and those phage of the T ..... 4 . „ . 

librarv which bound poorly. su In addltl0 ° t0 "f 8 ™* the expression of the gene, the 

Mowing selection, a number of separate clones were pr ° tem f C ™ bt ^ l ° *!^ 0SQ the 

isolated and phage produced from these were screened by ^ ™ ^T* u ^ " ? V SF? ^J*" 

ELISA for binding to the G12V ras sequence and discrimi- * ed & n0 ™ c DN ^ w f °* 11 sho ^ therefore 

nation against the wild-type ras sequence. A number of be P 0SSlblc ' wlthout lnventlve activity, to design 

clones were able to do this, and sequencing of phage DNA 55 dl *& osXlc tor detecting (e.g. point) mutations on DNA. 

later revealed that these fell into two categories, one of ELISA-based methods should prove particularly suitable, 
which had thc amino acid Asn at thc +3 randomised position, 
and another which had two other undesirable mutations. 

The appearance of Asn at position +3 is unexpected and 11 Ls ho P ed 10 fuse lhe /inc fin g er bindin S polypeptide to 

most probably due to the fact that proteins with a cytosine- 60 an scFv fra g ment wmch bind s to the human transferrin 

specific residue at position +3 bind to some £. coll DNA receptor, which should enhance delivery to and uptake by 

sequence so tightly that they are lethal. Thus phage display human cells. The transferrin receptor is thought particularly 

selection is not always guaranteed to produce the tightest- useful but, in theory, any receptor molecule (preferably of 

binding clone, since passage through bacteria is essential to high affinity) expressed on the surface of a human target cell 

the technique, and the selected proteins may be those which 65 could act as a suitable ligand, either for a specific immuno- 

do not bind to the genome of this host if such binding is globulin or fragment, or for the receptor's natural ligand 

deleterious. fused or coupled with the zinc finger polypeptide. 



